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Abstract 

In this work we study several types of diagonal-effect models for 
two-way contingency tables in the framework of Algebraic Statistics. 
We use both toric models and mixture models to encode the different 
behavior of the diagonal cells. We compute the invariants of these 
models and we explore their geometrical structure. 
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1 Introduction 

A probability distribution on a finite sample space X with k elements is a 
normahzed vector of k non-negative real numbers. Thus, the most general 
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probability model is the simplex 



A 



Pi>0 , ^Pi = 1 



(1) 



1=1 



A statistical model M is therefore a subset of A. 

A classical example of finite sample space is the case of two-way con- 
tingency tables, where the sample space is usually written as a cartesian 
product of the form X = {!,...,/} x {!,..., J}. We will consider this case 
extensively in the next sections. 

When Ai is defined through algebraic equations, the model M is said 
to be an algebraic model. In such algebraic and geometric tech- 

niques are useful to study the structure of the model and many statistical 
quantities such as sufficient statistics and maximum likelihood estimators. 
In recent literature this approach is known as "Algebraic Sta t istics" . For 



a survey on this field the r eader can refer to iPistone et al 
Pachter and Sturmfeld toO^ ). 



(2001a) and 



With this point of view, a statistical model is defined as the set of points 
in A where certain polynomials . . . ,Pk), ■ ■ ■ , fiipi, ■ ■ ■ ,Pk) vanish. No- 

tice that the non-negativity and normalization conditions increase the com- 
plexity of the geometrical study. In fact, Algebr a ic Ge ometry usually works 



HarrisI \imi ). while in Algebraic 



in complex projective spaces, see e.g. 
Statistics we have to consider a real affine variety and we must intersect the 
variety with the simplex. 



Pistone et al. ( 


2001b 


) and 


Geiger et al. 



while the us e of A lgebraic Statistics in various applications is presented in 
Riccomagnol ( 20091 ). 



The difference between models with positive probabilities and models 
with non-negative probabilities has been deeply studied in Algebraic Statis- 
tics. When only positive probabilities are involved usually one takes the 
log-probabilities log(pi), . . . , log(pfc). Hence, many statistical models are 
defined by linear equations in the log-probabilities. The most widely used 
models for contin gency ta b les ar e defined in this way and are called log- 



linear models, see Agresti ( 20021 ). The use of polynomial algebra instead 



of linear algebra has led to the study of models with non-negative proba- 
bilities. Therefore, the new class of toric models has been introduced and 
man y geometric proper ties have been related to statistical properties, see 
(I2OO6I ). Toric models gener alize the log-lin ear models to 



e.g 



Geiger et al 



include models with structural zeros, see e.g. iRapallol (Eo3). Exact infer' 
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ence on toric model can be done through MCMC methods based on Markov 
bases. The difference between positivity and non- negativity is also a ma- 
j or iss ue in the computation of Markov bases, see iDiaconis and Sturmfels 
(Il998l ) and IChen et all (l2005l ) where the difference between lattice bases 
and Markov bases is shown to be essential. 

In this paper we consider the diagonal-effect models, i.e., models encod- 
ing a special behavior of the diagonal cells of the table with respect to the 
independence model. It is a class of statistical models for square two-way 
contingency tables with a wide range of applications, from social mobility 
analysis in Psychometry to rater agreemen t analysis i n med ical and phar- 



maceutical sciences, see e.g. lAgrestil ISchusterl tooi ). Some results 



in Algebraic S t a.tistics for this kind of mod els h ave already been discussed i n 
Rapallol (|2nn,^ ). ICarlini and E,ap;iiiol ^200^ ) and lKrampe and Kuhntl tooii ). 



Due to the variety of the applications, this type of statistical models has 
been approached in many different ways and several mathematical defini- 
tions have been introduced, often to describe the same objects. In this 
paper, we will concentrate especially on toric models and mixture models. 

The main aim of this paper is to study the geometric structure of the 
diagonal-effect models, showing the differences between toric models and 
mixture models. In particular, we compute the invariants of these models. 
We recall that an invariant of a model is a polyn omial function vanishing 
in the points of the model, see Garcia et al. ( 20051 ). We show that the toric 
and mixture models differ not only on the boundary of the simplex but also 
in its interior, also when the models have the same invariants. 

In Section [2] we recall some basic definitions and results on toric models, 
with special emphasis on the independence model. In Section [3] we define 
the diagonal-effect models as both toric models and mixture models, we 
show that they have the same invariants, and we describe their structure, 
while in Section H] we study in more details their geometry. Finally, in 
Section [5] we study a special class of diagonal effect models which encodes a 
common behavior of the diagonal cells, i.e., all diagonal cells give the same 
contribution. 

The results presented here also suggest future works on these topics as 
such as: the comparison of two or more diagonal effect models; the study of 
the geometry of more complex models, such as diagonal models for multi- 
way tables or non-square tables; a better understanding of the notion of 
maximum likelihood estimates for this kind of models. 
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2 Basic facts on toric models and independence 



In this paper we consider a two-way contingency table as the joint observed 
counts of two categorical random variables X and Y . Let us suppose that 
the random variable X has / levels, and Y has J levels. Therefore, the 
sample space is the cartesian product X = {!,...,/} x {!,..., J} and the 
observed contingency table is a point / G . 

A probability distribution for an / x J contingency table is a matrix 
P = {Pi.j) such that pij = P(X = i,y = j)- Clearly the matrix P is such 
that pij > for alH = 1, . . . , / and j = 1, . . . J, and jPij = 1. In other 
words, the matrix P is a point of the closed simplex 



A 



P = iPi,j) ■ Pi,j > , "^Pij = 1 



(2) 



A statistical model is a subset of A. In most cases, the statistical model 
is defined through algebraic equations and the model is said to be algebraic. 
A wide class o f algeb r aic statistical r aodels is the class of toric models, see 
Pistone et all (|2001al l. IPistone et all (|2001bl l and lRapallol (|2007l l. 

In a toric model, the raw probabilities of the cells are defined in para- 
metric form as power products through a map (p : M>q — > M>q: 



h=l 



(3) 



Therefore, the structure of the toric model is encoded in an IJ x s non- 
ne gative integer matrix A which extends (pto a vector space homomorphism, 



see 



Pistone et al. Chapter 6. Notice that, in the open simplex 



P = (Pij) ■ Pi,j > , "^Pij = 1 



(4) 



the power product representation leads to a vector-space representation by 
taking the log-probabilities. Moreover, it is known that eliminating the C 
parameters from Equations in ([3]) one obtains the toric ideal 2a associated to 
the statistical model. The ideal Ia is a polynomial ideal in the ring R[p] = 
M[pi^i, . . . ,P7_j] generated by pure binomials. We recall that a binomial 
pO. _ pb p^pg if gcd{p"',p^) = 1. The notation p"' — is a vector notation 

for TT- P-' - TT- p"''- 
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A move for the toric model defined by the matrix A is a table m G Z^"^ 
with integer entries such that A*(m) = 0. The move m is represented in 
the ring M[p] by the pure binomial — where m+ and ra— are the 
positive and negative part of m. 

A Markov basis for the statistical toric model defined by the matrix A 
is a finite set of tables mi, . . . , G Z^"' that connects any two contingency 
tables /i and /2 in the same fiber, i.e. such that = A\f2), with 

a path of elements of the fiber. The path is therefore formed by tables of 
non- negative counts with constant image under . 

The relation between the notion of Markov basis and the toric ideal Ia 
is given in the theorem below. 

Theorem 2.1 ( Diaconis and Sturmfels ( 19981 ). Theorem 3.1). The set of 



moves {mi, . . . , mi} is a Markov basis if and only if the set ^ i 

!,...,£} generates the ideal Ta- 

In many applications this theorem has been used in its "if" part to deduce 
Markov bases from the com putation of a syste m of generators of a toric ideal, 
see e.g. Rapallol (j2003l ) and Chen et al. ( 20061 ). On the contrary, in the next 



section we will make use of Theorem 12.11 in its "only if" implication. 

In this paper the independence model will play a special role. It can be 
considered as the simplest toric model. The variables X and Y are indepen- 
dent if P(X = i,Y = j) = ¥{X = i)F{Y = j), i.e., the joint distribution is 
the product of the marginal distributions. The independence condition can 
be written as: 

Pi,i = Cf\f foralH,j (5) 

(r) (c) 

for suitable Q s and Q s. The non-negativity constraint reflects into 

(r) 

non-negativity of the parameters. Namely, we suppose Q > for all 
i = 1, . . . , /, and cj'^^ > for all j = I, ... J. Using Equation ([3]), the 
independence model is then defined as the set 

M = {P= (pij) : pij = C^'\f , 1 < i < /, 1 < j < J} n A , (6) 

(r) (c) 

for non-negative Q s and Q s. 

Notice that Equation Q implies that the matrix P has rank 1 and 
therefore a probability matrix P in the independence model must have all 
2x2 minors equal to zero. In formulae, it is therefore easy to write the 
independence model in implicit form as: 

M' = {P= (pij) : Pt,jPk,h-Pz,hPk,j = 0, l<i<k<I, l<j<h< J}nA . 

(7) 
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In Diaconis and Sturmfeld ( 19981 ). the authors have studied this set to find 
Markov bases for the independence model, while the corresponding polyno- 
mial ideal has been cons idered in Algebraic G e ometr y in the framework of 
determinantal ideals, see lHosten and SullivantI (|2004l ). 

As the independence model is toric. Lemma 2 in iRapallol (|2007l ') says 
that the model A4 in parametric form and the corresponding model A4' in 
implicit form coincide in the open simplex A>o. 

Proposition 2.2. With the notation above, in A>o we have that A4 = A4' . 
Proof. Using the same notation as above, consider the sets 

{P = : p,,, = Ci'kf , 1 < ^ < /, 1 < J < J} 

and 

{P = {Pi,j) ■ Pi,jPk,h - Pi,hPk,j = 0, 1 <i < k < I, 1 < j < h < J} . 

Taking the log-probabilities, both sets are defined as a linear system and it 
is immediate to show that they define two vector sub-spaces with the same 
dimension. □ 

It is known that A4 and A4' are in general different on the boundary 
A \ A->n . A co mplete description of this issue can be found in Section 4 of 
Rapalloi (|2007l l. 



3 Diagonal-effect models 

As mentioned in the Introduction, diagonal-effect models for square I x I 
tables can be defined in at least two ways. In the field of toric models, one 
can define these models in monomial form as follows. 

Definition 3.1. The diagonal-effect model Aii is defined as the set of prob- 
ability matrices P € A such that: 



and 



Pid = Q Cj for ty^J 



P^. = Ct^C^C^^ for ^ = J 



■>« 

where C^^^ and (^^'^^ are non-negative vectors with length I. 



(8) 
(9) 
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In litera t ure, s uch a model is also known as quasi-independence model, 



Agrestil (|2002l ). As the model in Definition 13.11 is a toric model, it is 



see 

relatively easy to find the invariants. Eliminating the parameters C^'^^ 
and C^^^ one obtains the following result. 

Proposition 3.2. The invariants of the model A4i are the binomials 

Pi,jPi',j' - Pi,j'Pi',j (10) 
for i, i' , j, j' all distinct, and 

Pi,i'Pi',i"Pi",i - Pi,i"Pi" ,i'Pi\i (11) 

for i, i! , i" all distinct. 

Proof. In Aoki and Takemura ( 20051 ). it is shown that a minimal Markov 
basis for the model Mi \s formed by: 

• The basic degree 2 moves: 



3 3 
i +1 -1 
i' -1 +1 



with i, i', J, j' all distinct, for / > 4; 
• The degree 3 moves of the form: 



i 


i' 


i" 





+1 


-1 


-1 





+1 


+1 


-1 






with i, z', i" all distinct, for / > 3. 

Thus, using Theorem 12.11 the binomials in Equations [10] and [11] form a set 
of generators of the toric ideal associated to the model M.i. □ 

Remark 3.3. To study the geometry of the model with structural zeros 
on the main diagonal it is enough to consider the variety defined by the 
polynomials in Proposition \3.S\ and intersect it with the hyperplanes {pi^i = 
0} for all i. 
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In the framework of the mixture models, the diagonal-effect models have 
an alternative definition as follows. 



Definition 3.4. The diagonal- effect model M.2 is defined as the set of prob- 
ability matrices P such that 



p = acr* + (1 - a)D 



(12) 



where r and c are non-negative vectors with length I and sum 1, D = 
diag((ii, . . . ,dj) is a non-negative diagonal matrix with sum 1, and a E [0, 1] . 

Remark 3.5. Notice that while in Definition {3^1 the normalization is ap- 
plied once, in Definition \3.4\ the normalization is applied twice as we require 
that both or* and D are probability matrices. This difference will be partic- 
ularly relevant in the study of the geometry of the models. 

First, we study the invariants and some geometrical properties of these 
models, then we will give some results on their sufficient statistics. 

Theorem 3.6. The models Aii and M.2 have the same invariants. 

Proof. Writing explicitly the polynomials in Equations ([8]) and ([9]) it is easy 
to check that each c!f^ appears in only one polynomi al. The same for each d. 



i n Equ ations (jl2p . Thus, following Theorem 3.4.5 in Kreuzer and Robbianol 
such polynomials are deleted when we eliminate the indeterminates 



Cf^^'s and di's. 

As the remaining polynomials, corresponding to off-diagonal cells, are 
the same in both models, the models A^i and M2 have the same invariants. 

□ 

In order to study in more details the connections between M.i and A^2 we 
further investigate their geometric structure. The non-negativity conditions 
imposed in the definitions imply that Mi 7^ M2 and neither M2 C Mi nor 
Ml C M2. We can show this by two easy examples. 

First, let (^"^^ and C^'^^ respectively the vectors, of length /, (j, j, . . . , j) 
and (jTii 7^1) • • • 1 7^1) and define (^^'^ as the zero vector. Thus, the prob- 
ability table we obtain in toric form is: 



P 




1 







7(i-T) 
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Such probability matrix belongs to Ali by constructions, while it does not 
belong to A^2- In fact, i = in Equation would imply either a = 
(a contradiction, as P is not a diagonal matrix), or C,[^^ = (a contradiction, 



.(c) 



as P has not the first row with all O's), or Q 
has not the first column with all O's). 

On the other hand, let P be the diagonal matrix 



(a contradiction, as P 



P 



7 








1 
I 



\0 








Such probability matrix belongs to M.2 by setting a = and D = diag( j, . . . , j), 
while it does not belong to A^i. To prove this it is enough to note that 
Pi 2 = would imply either Q = (a contradiction, as the first row of P 
is not zero), or (!^2^^ = (a contradiction, as the second column of P is not 
zero) . 

Nevertheless, in the open simplex we can prove one of the inclusions. 



Proposition 3.7. In the open simplex A 



M2 C Ml 



(13) 



Proof. In fact, let us consider a probability table in A^2i given by P = 
acr* + (1 - a)D. As P e A>o, a 7^ 0, / for alH = 1, . . . , / and cj / 
for all J = 1, . . . , /. Then we can describe P as an element of A^i in the 



following way. We define C, 



(r) 



(c) 

for all i and Q = ctCj, for all j. After 



that, it is enough to find the diagonal parameters by solving the equations 



(7) 



arid + (1 - a)di 



that is, as a 7^ 0, rj 7^ 0, and Cj 7^ 0, we have 



'(7) 



1 + 



(1 - a)di 
arid 



□ 



Moreover, in the open simplex A>o, the inclusion in Proposition 13.71 is 
strict. Let us analyze the probability matrices in the difference Aii \M2- 
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Consider three vectors C^*^^ = (Ci , • • • , C/ )> C^'^ = iCi , ■■■Xj) and 
(^(7) = {C['^\ . . . , Cj"'^)- Using these vectors, we define the probability table 
P as in Definition 13.11 and then we normalize it, i.e. dividing by Nt = 
E^^.C^Cf + E.=,Crc]^Vi"^ Define also N = E,,Cf^Cf (which can 
be seen as the normalization of the toric model when is the unit vector, 
i.e., it is the vector with all components equal to one). 

We want to find three vectors c = (ci,...,c/), r = (ri,...,r/), d = 
{di, . . . , dj), with E ~ E '^i ~ E '^i = 1 and a scalar < a < 1 such that 



a 



Ar)Ac) 

/rici riC2 riC3 
r2Ci r2C2 r2C3 
r-3Ci r3C2 r3C3 



\r/ci riC2 rjC3 



Ar)Ac) 
S3 '=2 



S2 

r2Ci 
rsci 



Ar)Ac) 
SI S3 

''2 ''3 



52 S/ 

53 S/ 



^('')^(c) 
S/ S3 



+ (1 - a) 



r/c// 



(di 

^2 

(is 

Vo 



0\ 





di) 



(14) 

We start studying the off-diagonal elements. Consider first the case 



(r) (c) 

Nt > N. Thus we have < 
only possible choice is given by 



' and < 1. In this situation the 



a 



N 



-(c) 



EC 



(r)' 



EC 



(15) 



In fact, recalling the definition of A^, we have 



ariCj 



Ar)Ac) 



(16) 



for all i, j with i ^ j. Taking the log- probabilities, we obta in a linear system. 
It is easy to prove, as in Chapter 6 of lPistone et al\ (j2001al ). that the rank of 
this system is equal to (21—1). Hence, considering the normalizing equations 
for r and c, we see that the solution in ([15]) is unique. 
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Let us consider the generic equation of the i-th diagonal element: 

Ct^^Ct'^ = ^nc, + (1 - a)d, . 
After substituting the previous values for r^, q and a we get 

- Nt N ^ Nt ' ■ 

(r) (c) 

As we consider matrices in A>o, the quantity Q Q is different from zero. 

(r) (c) 

Therefore, after multiplying for Nt and dividing by Q Q we obtain 
that is 

Ar)Ac) 

Thus we see that the P (z A4i\M2 when A'^j- > A^ and there exists at least 
an index i such that Ci"'^ < 1- 

When Nt = N, from Equations (|16|) we obtain a = 1. Therefore in 
Equation (jl4p the matrix on the right hand side has rank 1, and this implies 
that P G A/(2 if and only if (^'^^ = 1 for ah i. 

Consider now the case Nt < N. Hence we have ^ > ' and 

> 1. Again the only possible choice for the off-diagonal elements would 
be given by 



a = — ri = 7-T, c 



but in this case a = > 1. Thus we conclude that all P G A^i with 
Nt < N are in Aii \M2- This leads to the following result. 

Theorem 3.8. Let P G ^Al H A>o a strictly positive probability table 
given by the vectors C^'^^ = {Ci\- ■■,CP), &^ = {Ci\ ■■■,&^) and (^^^ = 
(C{"\...,C}")). Define Nt = E,,^, Cf \f + E,,=, Cf\f Ci"^ and N = 
Ei j Ci'^^Cj'^^ ■ Then P G Mi \ M2 if one of the following situations holds: 

(i) Nt < N; 

(ii) Nt = N and there exists at least an index i such that (^^"'^ ^ 1; 



11 



(iii) Nt > N and there exists at least an index i such that < 1. 

We conclude this section with a result on the sufficient statistics for the 
models M.\ and M.2- 

Proposition 3.9. For an independent sample of size n, the models Mi and 
M2 have the same sufficient statistic. 

Proof. In fact, let / = {fij) be the table of counts for the sample. The 
likelihood function for the model in toric form is 

i j i 

where and are the row and column marginal totals, respectively. 
This proves that the marginal totals together with the counts on the main 
diagonal are a sufficient statistic. With the same statistic we can also write 
the likelihood under the mixture model M.2'. 

L2{r, c, d, a; /) = H^'i^j' = H^^^^^j)^"' H^"^^^* + ~ = 
= «{n-E. Y[ J] JJ^arid + (1 - a)di)f^-' . 

i j i 

□ 



A geometric description of the diagonal-effect 
models 



In this section, we try to describe the models we studied using some ge- 
ometric flavor. This analysis will also shed some light on the elements in 
Ail \ M2- We use v ery bas i c and clas sic geometric idea s and facts. As 
references, we suggest iHarrisI (Il992l ) and iHartshornd (119771 ). 

We start with the model Mi. The basic object we need is the variety 
V describing all / x / matrices having rank at most one. When we fix 



(7) 



l,i = 1, ... ,7 the parametrization in dS]) and ([9]) is just describing 



(c) (r) (7) 

V. Hence, fixing values for all the Q s and the Q s and setting Q = 

(7) 

l,j = 1, . . . , / we obtain a point M ^ V. Now, if we let Q to vary we 
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are describing a line passing through M and moving in the direction of the 
vector (0, . . . , 1, . . . , 0), where the only non zero coordinate is the (/, /)-th; 
the set of all these lines is a cylinder. Now we set Q'f^ = aC, and Cm^ = bC, 
for fixed reals a and b. When we let Q vary, we are now describing a cylinder 
with directrix parallel to the line of equations bpi^i — apm,m, Pi,j = for 
ihj) 7^ {l,l),{m,m). The same argument can be repeated fixing linear 
relations among the diagonal elements. In conclusion, we can describe M.i 
as the intersection of the simplex with the union of cylinders having base V 
and directrix parallel to the directions given by diagonal elements. 

We now use the join of two varieties, i.e. the closure of the set of all 
the lines joining a point of any variety with any point of another variety. In 
order to do this, we also need to consider W the variety of diagonal matrices. 
Then M2 is the union of the segment joining a point of fl A with a point 
of VFn A, i.e. a subvariety of the join of V and W . Each of this segment lies 
on a line contained in one of the cylinder we used to construct Mi. Hence 
we get again the inclusion C A4i in A. 



5 Common-diagonal-efFect models 

A different version of the diagonal-effect models are the so-called common- 
diagonal-effect models. The definitions are as in the models above but: 

• The vector C,^"^'^ is constant in the toric model definition; 

• The matrix D is diag(j, . . . , 7) in the mixture model definition. 

This kind of models is much more complicated than the models in 
Section [3l Just to have a first look at these models, we note that for 
/ = 3 the diagonal-effect models have only one invariant. For the common- 
diagonal-effec t mod els, we have computed the invariants with CoCoA, see 



CoCoATeamI (j2007l ). for / = 3 and we have obtained the following lists of 
invariants. 

For the toric model we obtain 9 binomials: 

Pl,2P2,3P3,l - Pl,3P2,lP3,2 , 

^•1,3^2,2^3,1 - Pl,lP2,3P3,2 , 
-P1,1P2,3P3,2 + Pl,2P2,lP3,3 , 
-P2,2P2,3Pi,l + pI,iP3,2P3,3 , 

PI,2P2,2pI,1 - P1,IP2,IpI,2 ' 
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-^1,1^^1,3^3,2 +Pl,2P3,m,3 > 
-Pl,3P2,2PS,2 + pI,2P2,3P3,3 , 
-Pl,lP2,3P3,l + P1,3pI,iP3,3 , 
pI,3P2,IP2,2 - P1,1P1,2PI,3 ■ 

For the mixture model we obtain: 

• 1 binomial 

Pl,2P2,3P3,l - Pl,3P2,lP3,2 

• 12 polynomials with 4 terms 

Pl,3P2,lP2,2 - Pl,2P2,lP2,3 + Pl,3P2,3P3,l - Pl,3P2,lP3,3 , 

-Pl,2Pl,3P2,2 + Pl,2P2,3 - Pl,3P3,2 + Pl,2Pl,3P3,3 , 
Pl,3P2,lP3,l - Pl,lP2,3P3,l + P2,2P2,3P3,1 - P2,lP2,3P3,2 , 
Pl,2Pl,3P3,l - Pl,lPl,3P3,2 + Pl,3P2,2P3,2 - Pl,2P2,3P3,2 , 
Pl,3p\l - Pl,lP2,lP2,3 - P2,3P3,1 + P2,lP2,3P3,3 , 
Pl,3P2,l - Pl,m,3P2,3 + Pl,3P2,2P2,3 - P1,2pI,3 ' 
Pl,2Pl,3P2,l - Pl,lPl,2P2,3 - Pl,3P2,3P3,2 + Pl,2P2,3P3,3 , 

-P2,1P2,2P3,1 - ^'2,3^1,1 + pi,lP3,2 + P2,lP3,lP3,3 , 
-Pl,2P2,2P3,l + Pl,2P2,lP3,2 - Pl,3P3,lP3,2 + Pl,2P3,lP3,3 , 

Pl,2P3,l - Pl,lP3,lP3,2 - P2,2P3,lP3,2 - P2,1pI,2 > 
Pl,2P2,lP3,l - Pl,lP2,lP3,2 - P2,3P3,lP3,2 + P2,lP3,2P3,3 , 
Pl,2P3,l - Pl,lPl,2P3,2 - Pl,3P3,2 + Pl,2P3,2P3,3 ] 

• 6 polynomials with 8 terms 

Pl,lPl,3P2,2 - Pl,3P%2 - Pl,lPl,2P2,3 + Pl,2P2,2P2,3+ 

+ Pl,3P3,l - Pl,3P2,3P3,2 - Pl,lPl,3P3,3 + Pl,3P2,2P3,3 , 

P1,1P1,3P2,1 - pIiP2,3 - Pl,2P2,lP2,3 + Pl,lP2,2P2,3 + 

+ pi,3P3,2 - P1,3P2,1P3,3 + Pl,lP2,3P3,3 " 1^2,2^2,3^3,3 , 
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- Pl,lP2,2PZ,l +P2,2P3,1 -Pl,3P3,l + Pl,lP2,lP3,2 + 

-P2,lP2,2P3,2 +P2,3P3,lP3,2 +Pl,lP3,lP3,3 - P2,2P3,lP3,3 , 



Pl,lPl,2P3,l -Pl,lP3,2 - Pl,2P2,lP3,2 + Pl,lP2,2P3,2 + 

+ P2,3P3,2 - Pl,2P3,lP3,3 + ?'l,l?'3,2P3,3 " P2,2P3,2P3,3 , 



Pl,2ph - P1,1P2,1P2,2 - P1,1P2,3P3,1 - P2,1P2,3P3,2 + 

+ Pl,lP2,lP3,3 + P2,lP2,2P3,3 + ?'2,3P3,lP3,3 " ?'2,1?'3,3 ' 



Pl,2P2,l - Pl,lPl,2P2,2 - Pl,lPl,3P3,2 " ?'l,2P2,3P3,2 + 

+ Pl,lPl,2P3,3 + Pl,2P2,2P3;3 + ?'l,3?'3,2?'3,3 " P1,2pI,3 i 

• 1 polynomial with 12 terms 

Pl,lPl,2P2,l -pl,lP2,2 -Pl,2P2,lP2,2 +Pl,ipi,2 + 

- ?'1,1?'1,3?'3,1 + P2,2P2,3P3,2 + P?,lP3,3 " ^2,2^3,3 + 

+ Pl,3P3,lP3,3 - P2,3P3,2P3,3 " ?'l,l?'3,3 + ?'2,2?'i,3 • 

In the case of toric models, the invariants can be char acterized theoreti - 
cally. In fact, also in this Markov basis is known. In lHara et al\ (|2008l ^ 



it is shown that a Markov basis for this toric model is formed by 6 different 
types of moves. We need the 2 types of moves for the diagonal-effect model 
plus the moves below: 

• The degree 3 moves of the form: 



I 
i' 



i 


i' 


i" 


i 




i" 


i 


i' 


i" 


+1 





-1 


i +1 


-1 





i 


-1 


+1 





-1 


+1 


i' -1 





+1 


i' -1 


+1 





-1 


+1 





i" 


+ 1 


-1 


i" +1 





-1 



with i, z', i" all distinct, for / > 3; 
• The degree 3 moves of the form: 
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i i' j 

i +1 -1 

i' -1 +1 

/ -1 +1 

with i, i' , j, j' all distinct, for / > 4. 

• The degree 4 moves of the form: 

i i' j 
i +1 +1 -2 
i' -1 -1 +2 

with i, i', j all distinct, for / > 3, and their transposed. 

• The degree 4 moves of the form: 

i i' j j' 
i +1 +1 -1 -1 
i' -1 -1 +1 +1 

with i, z', J, j' all distinct, for / > 4, and their transposed. 

Therefore, as in Proposition 13.21 we can easily derive the invariants. We 
do not write explicitly the analog of Proposition 13.21 for common-diagonal- 
effect models in order to save space. 

The study of the common-diagonal-effect models in mixture form is much 
more complicated. In fact, notice that in the computations above, the mix- 
ture model present invariants which are not binomials. However, some par- 
tial results can be stated. 

Theorem 5.1. (a) Fori, j, k, I all distinct we define 

bijkl = Pi,jPk,l — Pi,lPk,j ; 

(b) For i, j, k, all distinct we define 

^ijk ~ Pi,jPj,kPk,i Pi,kPk,jPj,i j 

(c) For {i,j) and {k,l) two distinct pairs in {1,...,/} with i ^ j, and 
k 7^ / and m € {1, . . . , I}\{i, j} and n G {I, . . . , I}\{k,l} with m ^ n 
we define 

fijklmn ~ Pi,jPk,lPn,n ~ Pi,jPn,lPk,n ~ Pi,jPk,lPm,m ~l~ Pk,lPm,jPi,m j 
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(d) for two distinct indices i and j in {1, ... , 1} and for /c € {1, ...,/} \ 
{i,j} we define 



9ijk =Pi,jPi,iPk,k + Pi,jPj,jPk,k - Pi,jPi,iPj,j + Pi,jPk,kPk,k+ 
+ Pk,kPi,kPk,j - Pi,iPi,kPk,j + Pi,jPj,i - Pi,jPk,jPj,k ; 

(e) For i, j, k, all distinct we define 

hijk =Pi,iPlj + PliPk,k + Pj,jPl,k - PliPj,j - p'j,jPk,k - Pi,iPl,k + Pi,iPi,jPj,i + 

- Pi,iPi,kPk,i + Pj,jPj,kPk,j - Pj,jPj,iPi,j + Pk,kPk,iPi,k - Pk,kPk,jPj,k ■ 

Then the previous polynomials are invariants for the common- diagonal- 
effect models in mixture form. 

Proof. Cases (a) and (b) follow from Proposition 13.21 since the off-diagonal 
elements of the probability table are described, up to scalar, in the same 
monomial form as for the elements of A^i. 

For case (c), consider the term gi = pijPk,iPn,n in fijkimn- This gives 
two monomials: a^riCjrkCirnCn and a'^riCjrkCi{l — a)d, where d = l/I. The 
term —g2 = —PijPn,lPk,n of fijkimn cancels the first monomial of gi. In fact 
—Pi,jPn,iPk,n = ot'i'iCjT^nCi'''kCn- Since in 52 there are not diagonal variables, 
we need another term in order to cancel the second monomial of gi. Thus 
we subtract, to gi — g2, a term of the form g^ = PijPk,iPm,m which gives the 
monomials — a^rjCjrfcQ(l — a)d and —a^riCjrkCirmCm- To cancel this last 
monomial it is enough to add the term 5(4 = Pk,iPm,jPi,m = Oi^fkCirraCjriCm- 
Thus fijkimn = 91—92 — 93 + 94: Vanishes on the entries of a probability table 
of the mixture model with common diagonal effect. 

For case (d) , consider first the terms with pairs of variables on the diag- 
onal. 



Pi,jPi,iPk,k =a^rfrkCiCjCk + a^rfacjd - a^rfciCjd+ a'^nrkCjCkd 



+ 



-|- ariCjd — 2a riCjd — a r^rkCjCkd + a rtCjd ; 



Pi,jPj,jPk,k =a^rirjrkCjCk + a^rirjC^d - a^rirjCjd + a^rirkCjCkd+ 



+ ariCjd^ — 2a^riCjd^ 



a^rirkCjCkd 



+ a^riCjd^ ; 



Pi,jPi,iPj,j =Oi r- rjCiCj + a r^ CiCjd - a^^r^ CiCjd + a rirjCjd+ 

r\ Q Q o (1 Q O 

+ ariCjd —2a ViCjd —a rirjCjd + a riCjd ; 
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Pi,jPl,k =(^^rirlcjcl + 2a^rirkCjCkd - 2a^rirkCjCkd + ariCj(f+ 
— 2a'^riCj(f + a^riCjd^ . 



It is easy to see that while some terms, such as a^riCjd?, are simply 
cancelled considering the difference of two monomials, other terms, such 
as the boxed ones, appear in different monomials. However, they appear 
with the appropriate coefficients and considering PijPi^iPk,k + Pi,jPj,jPk,k ~ 
Pi,jPi,iPj,j ~ PijPk k cancel most of them. In fact we obtain 

a^rfrkCiCjCk - a^rirlcjcl - a^rfrjcja + a^rirjrkcjck ■ 

The only way to cancel the term —a^rir'^Cjc'l is to add the monomial 
Pi,kPk,jPk,k = a^nrlcjcl+a'^rirkCjCkd-a^rirkCjCkd. However this monomial 
adds two more terms that can be cancelled by using another monomial with 
a variable in the diagonal, that is Pi^iPi^kPkj = cc"r'jrkCjCjCk + a^rirkCjCkd — 
a^VirkCjCkd. After that, the only two missing terms are —a^rfrjCjCi + 
a^riVjrkCjCk which can be cancelled by adding PijPj,i — Pi,jPk,jPj,k- 

For the case (e), we omit the complete details of the proof. One has 
to proceed as in cases (c) and (d) considering separately Pi,ip'j j +pjiPk,k + 
Pj,jPl,k ~ PliPjd ~ P\jPk,k - Pi,iPk,k contributions of Pi,iPi,jPj,i - 

Pi,iPi,kPk,i, PjjPj,kPk,j - Pj,jPj,iPid and Pk,kPk,iPi,k - Pk,kPk,jPj,k- □ 

With some computations with CoCoA, we have found that the poly- 
nomials defined in Theorem 15.11 define the model M2 for / = 3, 4, 5. We 
conjecture that this fact is true in general. 
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